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General information 



name ATPGJHAEIN 
.^c|ie||ion^ij^|er1 P43716 

SrellEefctfll 



Rel. 32, l-NOV-1995 
Se|ugte updpfe l| Rel. 32, l-NOV-1995 
I^Jf^SWB Rel - 40 ' 16-OCT-2001 



Description and origin of the Protein 



S|^Iqme|||| 



ATP SYNTHASE GAMMA CHAIN (EC 3.6.3.14V 
|j ATPG OR HI0480. 
1] Haemophilus influenzae. 



i,g|^^m|ou|5| 
.Tdxdripmy ; 3K}=j Bacteria; Proteobacteria; gamma subdivision; 



NCBI Tdxli 



References 



!!§§! 



l^lill 



Bill 



Pasteurellaceae; Haemophilus. 
727 

p| Fleischmann,R.D., Adams ,M.D., White,0., Clayton,R.A., 
||J Kirkness,E.F., Kerlavage,A.R., Bult,C.J., Tomb,J.-F., 
|fj Dougherty, B. A., Merrick,J.M., McKenney,K., Sutton,^., 
Fitzhugh,W., Fields,C.A., Gocayne,J.D., Scott ,J.D., 
Shirley ,R., Liu,L.-I., GlodekA, Kelley,J.M., Weidman,J.F., 
Phillips,C.A., Spriggs,T., Hedblom,E., Cotton,M.D., 
Utterback,T.R., Hanna,M.C, Nguyen,D.T, Saudek,D.M., 
Brandon,R.C, Fine,LD., Fritchman,J.L., Fuhrmann,J.L, 
Geoghagen,N.S.M., Gnehm,C.L., McDonald,LA., 5mall,K.V., 
Fraser,C.M., Smith,H.O., Venter ,J.C, 
Whole -genome random sequencing and assembly of 
Haemophilus influenzae Rd. 
(1995) Science 269:496 

& :; ;PoSfti! SEQUENCE FROM N.A. 
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STRAIN=RD / KW20 / ATCC 51907; 

95350630 

7542800 



■IB 




■ 

t; 




PRODUCES ATP FROM A DP IN THE 
PRESENCE OF A PROTON 
GRADIENT ACROSS THE 
MEMBRANE. THE GAMMA CHAIN 
IS BELIEVED TO BE IMPORTANT 
IN REGULATING ATPASE 
ACTIVITY AND THE FLOW OF 
PROTONS THROUGH THE CF(0) 
COMPLEX. 

F-TYPE ATPASES HAVE 2 
COMPONENTS, CF(1) - THE 
CATALYTIC CORE - AND CF(0) - 
THE MEMBRANE PROTON 
CHANNEL. CF(1) HAS FIVE 
SUBUNITS: ALPHA(3), BETA(3), 
•«f GAMMA(l), DELTA(l), EPSILON(l). 
CF(0) HAS THREE MAIN 
^ J SUBUNITS: A, B AND C. 

f j l| BELONGS TO THE ATPASE GAMMA 
CHAIN FAMILY. 



Copyright 



This 5WISS-PROT entry is copyright. It is produced through a 
collaboration between the Swiss Institute of Bioinformatics and the EMBL 
outstation - the European Bioinformatics Institute. There are no 
restrictions on its use by non-profit institutions as long as its content is in 
no way modified and this statement is not removed. Usage by and for 
commercial entities requires a license agreement (See 
http://www.isb-sib.ch/announce/ or send an email to license@isb-sib.ch). 
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Database cross-references 



•*» 



U32730 ; AAC22138.1; -. 
P05631; 1BMF . 
HI0480 : -. 

IPR000131 : ATPase _gamma. 
PF00231 : ATP-synt; 1. 
PR00126; ATPASEGAMMA. 
PS00153: ATPASE_£AMMA; 1. 



Keywords 



ATP synthesis : CF(1) ; Hydrogen ion transport ; Hydrolase ; Complete 
proteome ; 



Sequence information 



// 



MAGAKEIKTK 
IGYKHPFLVE 
KGISFFRSFG 
QKPWQQLVP 
EQAARMVAMK 



IASVQSTQKI 
REVKKIGILV 
FNIKGQLSGL 
LPESKDDHLN 
AATDNAGNLI 



TKAMEMVATS KMRKTQDRMA ASRPYSETIR NVISHVSKAS 
ISTDRGMCGG LNVNLFKTTL NQIKNWKEQN ISTDLGLIGS 
GDTPALEELI GVANTMFDAY RNGEIDAVYI AYNKFVNTMS 
ERQQTWDYLY EPEPKVLLDS LLVRYLESQI YQAWDNVAS 
NDLRLVYNKA RQASITNELN EIVAGAAAI 
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ATP synthase gamma subunit 

jjicccssipn :e JIPR000131 ; (ATPase_gamma) matches 100 proteins 
Fuji I Name T ' l ^ /Vi :% |ATP synthase gamma subunit 



m 



uresi 



{Molecular 
I Function 



{Domain 

j PRINTS: PR00126 ATPASEGAMMA 
IPROSITE: PS00153 ATPASE_6AMMA 
IPFAM: PF00231 ATP-synt 



| hydrogen-transporting two-sector ATPase (60:0003936) 



Biological 
Process 



Cellular 
Component; 



{Abstract 



I hydrogen transport (60:0006818) 



| membrane (60:0016020) 



ATP synthase (proton-translocating ATPase) rEC: 3.6. 1.34 1 j 
[1, 2] is a component of the cytoplasmic membrane of 
eubacteria, the inner membrane of mitochondria, and the j 
thylakoid membrane of chloroplasts. The ATPase complex j 
is composed of an oligomeric transmembrane sector, called! 
CF(0), and a catalytic core, called coupling factor CF(1). 
The former acts as a proton channel; the latter is 
composed of five subunits, alpha, beta, gamma, delta and 
epsilon. Subunit gamma is believed to be important in 
regulating ATPase activity and the flow of protons 
through the CF(0) complex. The best conserved region of j 
the gamma subunit [3] is its C-terminus which seems to be | 
essential for assembly and catalysis. 



(E^qinples 



; P36542 
: P00837 
P29790 

View Signature matches on the examples 



1. Futai,M., Noumi,T., Maeda,M., 

ATP synthase (H+- ATPase) - Results by combined 
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1 . Science. 1995, 269 (5223):496-512 Related Articles by NCBI 

Whole -genome random s quencing and ass mbly of 
Haemophilus influenzae Rd. 

Fleischmann,R D. , Adams .M D. , White ,0. , Clayton,R A. , Kirkness,E F. , 
Ker lavage ,A R. , Bult.C J. , Tomb J F. , Dougherty, B A. , Merrick, J AA. 

Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA. 

An approach for genome analysis based on sequencing and assembly of 
unselected pieces of DNA from the whole chromosome has been applied 
to obtain the complete nucleotide sequence (1,830,137 base pairs) of the 
genome from the bacterium Haemophilus influenzae Rd. This approach 
eliminates the need for initial mapping efforts and is therefore applicable 
to the vast array of microbial species for which genome maps are 
unavailable. The H. influenzae Rd genome sequence (Genome Sequence 
DataBase accession number L42023) represents the only complete 
genome sequence from a free- living organism. 

MedlinelD: 95350630 
PMID: 7542800 
ISSN: 0036-8075 

Publication Type: 
Journal Article 

Substances: 
Bacterial Proteins 
DNA, Bacterial 
RNA, Bacterial 
RNA, Ribosomal 
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1 . Science. 1995, 269 (5223):496-512 Related Articles by NCBI 

Whole-genome random sequencing and assembly of 
Haemophilus influenzae Rd. 

Fleischmann.R D. , Adams M D. , White,Q. , Clayton,R A. , Kirkness.E F. , 
Ker lavage, A R. , Bult.C J. , Tomb J F. , Douqherty.B A. , Merrick J M. 

Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA. 

An approach for genome analysis based on sequencing and assembly of 
unselected pieces of DNA from the whole chromosome has been applied 
to obtain the complete nucleotide sequence (1,830,137 base pairs) of the 
genome from the bacterium Haemophilus influenzae Rd. This approach 
eliminates the need for initial mapping efforts and is therefore applicable 
to the vast array of microbial species for which genome maps are 
unavailable. The H. influenzae Rd genome sequence (Genome Sequence 
DataBase accession number L42023) represents the only complete 
genome sequence from a free-living organism. 

MedlinelD: 95350630 
PMID: 7542800 
ISSN: 0036-8075 

Publication Type: 
Journal- Article 

Substances: 
Bacterial Proteins 
DNA, Bacterial 
RNA, Bacterial 
RNA, Ribosomal 
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ID AR089428 standard; DNA; UNC; 1374 BP. 
XX 

AC AR089428; 
XX 

SV AR089428.1 
XX 

DT 12-SEP-2000 (Rel. 65, Created) 

DT 12-SEP-2000 (Rel. 65, Last updated, Version 1) 
XX 

DE Sequence 187 from patent US 5994066. 

XX 

KW 

XX 

OS unidentified 

OC unclassified. 
XX 

RN [1] 

RP 1-1374 

RA Bergeron M.G., Picard F.J., Ouellette M. , Roy P.H.; 

RT "Species-specific and universal DNA probes and amplification primers to 

RT rapidly detect and identify common bacterial pathogens and associated 

RT antibiotic resistance genes from clinical specimens for routine diagnosis 

RT in microbiology laboratories"; 

RL Patent number US5994066 - A/ 1 8 7 , 30-NOV-1999. 

XX 

FH Key Location/Qualifiers 



FH 
FT 
FT 
FT 
XX 
SQ 



Sequence 1374 BP; 425 
atgtcagcag gaaaaattgt 
gatgcagtgc caaaagttta 
caacaacaat taggtggcgg 
aaacgtggct taaaagtaga 
acacttggtc gtattatgaa 
gaagaagagc gttgggctat 
acggaattat tagaaactgg 
ggtaaagttg gtttattcgg 
attcgtaata ttgcgattga 
actcgtgaag gtaatgactt 
tctttggttt atggtcaaat 
ggtttaacta tggctgaaaa 
aatatctatc gttataccct 
tccgcggtag gttaccaacc 
acttcaacca aaacaggttc 
ttaactgacc catctccagc 
cgtcaaatcg catctttagg 
cagctagacc cgcttgttgt 
attttacaac gttataaaga 
tctgaagaag ataaactagt 
ccattctttg ttgcagaagt 
accatccgtg gcttcaaagg 
ttctatatgg ttggttcaat 



source 



1. . 1374 

/db xref="taxon: 32644 " 
/organism^ "unidentified 



A; 238 C; 312 G; 399 T; 0 other; 
acaaatcatc ggtgcggtga ttgacgttga 
cgatgcatta aaagttgaat caggtttaac 
agtagttcgt tgtattgcat tgggtacatc 
aaacacaaat aacccaattc aagtaccagt 
tgtgttgggc gaaccaattg acgaacaagg 
ccaccgttcg gcaccgagct atgaagaaca 
tatcaaagtg atcgacttaa tttgtccatt 
cggtgcaggt gtaggtaaaa ccgtaaatat 
gcattcaggt tactctgtat ttgcgggtgt 
ctatcatgaa atgaaagatt ctaacgtatt 
gaacgagcca ccaggtaacc gtttacgtgt 
attccgtgat gaaggtcgag atgtattatt 
tgctggtacg gaagtatctg cgttattagg 
gacattggca gaagaaatgg gtgtgttaca 
tatcacttct gtgcaagcgg tgtacgtacc 
aacaactttc gcacatttag actcaactgt 
tatttaccct gcagttgatc cattagattc 
tggtcaagaa cattatgatg ttgctcgtgg 
attgaaagat attatcgcaa ttcttggtat 
ggtagcacgt gcacgtaaaa ttgaacgttt 
cttcacaggt tcaccaggta aatacgtgac 
tatcttagat ggcgaatatg accatattcc 
cgatgaagtg ttagaaaaag ccaaaaatat 



it 



atttccacaa 
acttgaggtg 
tgatggttta 
aggcacaaaa 
agcaattggt 
atcaaacagt 
cgcaaaaggt 
gatggagttg 
aggcgaacgt 
agataaagta 
tgcattgact 
ctttgtggat 
tcgtatgcca 
agaacgtatc 
agcggatgac 
tgtattaagt 
aacttcacgt 
tgtacaaggt 
ggatgaatta 
cttatcacaa 
attaaaagac 
tgaacaagcg 
gtaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1374 
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ID HI32730 standard; DNA; PRO; 13828 BP. 
XX 

AC U32730; L42023; 
XX 

SV U32730.1 
XX 

DT 09-AUG-1995 (Rel. 44, Created) 

DT 15-JUN-1998 (Rel. 56, Last updated, Version 9) 
XX 

DE Haemophilus influenzae Rd section 45 of 163 of the complete genome. 

XX 

KW 

XX 

OS Haemophilus influenzae Rd 

OC Bacteria; Proteobacteria; gamma subdivision; Pasteurellaceae; Haemophilus; 

OC Haemophilus influenzae. 

XX 

RN [1] 

RP 1-13828 

RX MEDLINE; 95350630 . 

RA Fleischmann R.D., Adams M.D., White O., Clayton R.A. , Kirkness E.F., 

RA Kerlavage A.R., Bult C.J., Tomb J., Dougherty B.A. , Merrick J.M., 

RA McKenney K., Sutton G.G., FitzHugh W. , Fields C.A., Gocayne J.D., 

RA Scott J.D., Shirley R., Liu L.I., Glodek A., Kelley J.M., Weidman J.F., 

RA Phillips C.A., Spriggs T., Hedblom E., Cotton M.D., Utterback T., 

RA Hanna M.C., Nguyen D.T., Saudek D.M., Brandon R.C., Fine L.D., 

RA Fritchman J.L., Fuhrmann J.L., Geoghagen N.S., Gnehm C.L., McDonald L.A., 

RA Small K.V., Fraser CM., Smith H.O., Venter J.C.; 

RT "Whole-genome random sequencing and assembly of Haemophilus influenzae Rd"; 

RL Science 269 ( 5223 ): 496-512 ( 1995 ) . 

XX 

RN [2] 

RP 1-13828 

RX MEDLINE; 96398784 . 

RA Tatusov R.L., Mushegian A.R., Bork P., Brown N.P., Hayes W.S., 

RA Borodovsky M. , Rudd K.E., Koonin E.V.; 

RT "Metabolism and evolution of Haemophilus influenzae deduced from a 

RT whole-genome comparison with Escherichia coli" ; 

RL Curr. Biol. 6 ( 3 ): 279-291 ( 1996) . 
XX 

RN [3] 

RP 1-13828 

RA White O., Clayton R.A., Kerlavage A.R., Fleischmann R.D.; 
RT 

RL Submitted (25- JUL-1995 ) to the EMBL/ GenBank/DDB J databases. 

RL The Institute for Genomic Research, 9712 Medical Center Dr, Rockville, MD 

RL 20850, USA 

XX 

RN [4] 

RP 1-13828 

RA White O., Clayton R.A., Kerlavage A.R., Fleischmann R.D.; 
RT 

RL Submitted ( 27-SEP-1997 ) to the EMBL/ GenBank/DDBJ databases. 

RL The Institute for Genomic Research, 9712 Medical Center Dr, Rockville, MD 

RL 20850, USA 

XX 

RN [5] 

RP 1-13828 

RA White O. , Clayton R.A., Kerlavage A.R., Fleischmann R.D., Peterson J., 

RA Hickey E.;, Dodson R., Gwinn M.; 

RT 

RL Submitted ( 28-MAY-1998 ) to the EMBL/ GenBank/DDBJ databases. 

RL The Institute for Genomic Research, 9712 Medical Center Dr, Rockville, MD 

RL 20850, USA 

XX 
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DR SWISS-PROT; 086225 ; Y48A_HAEIN. 

DR SWISS-PROT; P43714 ; AT PA_HAE I N . 

DR SWISS-PROT; P43715 ; ATPB_HAEIN . 

DR SWISS-PROT; P43716 ; ATPG_HAEIN. 

DR SWISS-PROT; P43717 ; AT PD_HAE I N . 

DR SWISS-PROT; P43718 ; ATPE_HAEIN. 

DR SWISS-PROT; P43719 ; ATP6_HAEIN. 

DR SWISS-PROT; P43720 ; ATPF_HAEIN. 

DR SWISS-PROT; P43721 ; ATPL_HAEIN. 

DR SWISS-PROT; P44002; Y476__HAEIN. 

DR SWISS-PROT; P44327 ; HIS7_HAEIN. 

DR SWISS-PROT; P44340 ; HIS5_HAEIN. 

DR SWISS-PROT; P44423 ; HIS8_HAEIN. 

DR SWISS-PROT; P44434; HIS2_HAEIN. 

DR SWISS-PROT; P44435 ; HIS4_HAEIN. 

DR SWISS-PROT; P44436 ; HIS6_HAEIN. 

DR SWISS-PROT; P44727; TYRP_HAEIN. 
XX ' 

FH Key Location/Qualifiers 
FH 

FT source 1 . .1382 8 

FT /db xref="taxon: 71421 " 

FT /organism="Haemophilus influenzae Rd" 

FT CDS 65. . 1168 

FT /codon_start=l 

FT /db_xref=" SWISS -PROT: P44423" 

FT /note= !, similar to SP:P06986 GB:U02071 GB:X03416 PID: 41695 

FT PID: 41710 percent identity: 60.40; identified by sequence 

FT similarity; putative" 

FT /transl_table=ll 
FT /gene="HI0470" 

FT /product="histidinol-phosphate aminotransferase (hisC) " 

FT /protein id=" AAC22 129 . 1 " 

FT / translation="MNDYNSHILQWKKTIMTITTLSRQNIQALTPYQSARKLGGNGTIW 

FT LNANEYPTSPEFQLSGKDLNRYPEPQPQRWQAYANYAGVSTENVLVTRGGDEGIELII 

FT HTFCEPKQDAILFCPPTYGMYAVSAETAGVLSKTVPLTDDFQLNLPEIKNHLNDVKWF 

FT VCSPNNPTGNLLKQSDILDLLQITAGKAIVYVDEAYIEFCPEASVINLLKNYPHLAIIR 

FT TLSKAFALAGLRCGFVIANPELIDILSKVIAPYPIPVPSADL7VEQALRPSNIATVQALT 

FT QELLSNRQWLAKALLVLHQVEKVYESEANYLLIKCQNGQAVFKALWEQGIILRDQNKTL 

FT HLQNC I R I TVGTRNE CE KWE AI KE VK " 

FT CDS 1280. .2368 

FT /codon_start=l 

FT / db_xr e f = " SWI S S - PROT : P44327 " 

FT /note="similar to SP:P06987 GB:X03416 PID: 41696 PID: 41711 

FT GB:U00096 percent identity: 64.97; identified by sequence 

FT similarity; putative" 

FT /transl_table=ll 
FT /gene="HI0471" 

FT /product =, 'imidazoleglycerol-phosphate dehydratase / 

FT histidinol-phosphatase (hisB) " 

FT /protein id=" AAC22130 . 1" 

FT / trans la tion="MQPTLFIDRDGTLIDEPKTDFQIDSLEKLKLEPKVIPALLRLKAK 

FT YRFVIVSNQDGLGTDAFPQTDFDKPHNVMMALFESQGITFDEVLICPHKPEENCLCRKP 

FT KIKLLDHYIRKNLFDIDRSFVIGDRETDVQLAENLGIRAIQYDPQKMNWDLIAEKLLGE 

FT TVTNCGKRPPRFAEVIRQTKETDIKVQVWLDEAGVNEIKTGVGFFDHMLDQIATHGGFR 

FT MNVQCKGDLWIDEHHTVEDTALALGQALKQAVGDKRGIARFGFVLPMDECKAECALDLS 

FT GRPWIKFNACFKRDKVGDFSTELTEHFFQSLAFSMLATLHLNVTGNNDHHKIESLFKAF 

FT GRTLRQAIRIEGNEMPSSKGVL" 

FT CDS 2434 . .3033 

FT /codon_start=l 

FT /db_xr e f = " SWI SS- PROT : P44340 " 

FT /note="similar to SP:P10375 PID: 41712 GB:U00096 PID: 1736701 

FT PID: 1736710 percent identity: 55.90; identified by sequence 

FT similarity; putative" 

FT /transl table=ll 
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£ i 




/gene— HiUfi/2 


£ 1 




/product="amidotransf erase (hisH) " 


£ 1 




/protein id= AAC22131 . 1 


FT 




/translation="MINITIIDTGCANLSSVKFAFDRLGYNTEITFDLNKIKSADKLIL 


FT 




PGVGTANAANYNLQERQLIETIQNLTQPVLGICLGMQLMTEFSEEGNVPTLNLISGKTN 


FT 




RI PDTGLPLPQMGWNRVQFVKNCPLFDGI VQNSHFY FVHS YAVS PNEHS VAI SNYGVNF 


FT 




S AAI AKE N FY GVQ FH PE RS GKNGALLLKN FVE KVP F " 


FT 


CDS 


3069 . .3818 


TP T 1 

£ 1 




/codon start=l 


FT 




/db xref ="SWISS-PROT : P44435" 


FT 




/note="similar to SP:P10371 PID: 41713 GB:U00096 PID: 1736702 


T7"T> 

£ 1 




PID: 1736711 percent identity: 62.86; identified by sequence 


FT 




similarity; putative" 


T7lf-p 

£ I 




/ transl_table=ll 


£ 1 




/gene— niufi/o 


TTT 1 
£ 1 




/product="phosphoribosylf ormimino— 5-aminoimidazole 


r 1 




carboxamide ribotide isomerase (hisA) " 


£ 1 




/protein id— AAC22132 . 1 


FT 




/translation="MKQSIIIPALDLINGQWRLHQGDYAKQTTYSDNPIKQFDNYVRQ 


FT 




GAKQLHLVDLTGAKNPQSRQTALIGKIVEATQC?CVQVGGGIRTEQDVADLLAVGANRVV 


FT 




IGSTAVTHRSMVKNWFIKYGAEKFVLALDVNINASGQKIVAISGWQEESGVLLETLIED 


FT 




FQTVGLQQVLCTDISRDGTLTGSNIGLYQEICEKYPPIQFQSSGGIGSLADIEALKGTG 


FT 




VSGVIVGRALLEGKFTLSEAIKCWQNG 


TTT 1 
£ 1 




joUU. . flO / b 


£ 1 




/codon start=l 


FT 




/db xref ="SWISS-PROT : P44436" 


E i 




/note— similar to SP:P10373 PID:41714 PID:598474 GB:U00096 


FT 




PID: 1736704 percent identity: 82.03; identified by sequence 


FT 




similarity; putative" 


£ 1 




/transl table=ll 


FT 




/ _, „ II TTT A ""7 A II 

/ gene= HI 047 4 


£ 1 




/product— nisF cyclase (nisF) 


rim 

£ 1 




/protein id= AAC22133 . 1 


FT 




/ translation="MLAKRIIPCLDVRDGQWKGVQFRNHEIIGDIVPLAQRYAQEGAD 


FT 




ELVFYDITASSDGRTVDKSWVERIAQVIDIPFCVAGGIKTIEDAEKLFAFGADKISINS 


FT 




PALADPTLISRIADRFGVQAIWGIDSWFEQETGKYWVNQYTGDETRTRQTHWQLLDWV 


FT 




KEVQQCGAGE I VLNMMNQDGLRNGYDLAQLKLVRGVCRVPLIAS GGAGKMVHFRDAFI E 


rim 

£ 1 




AKVDGALAAS VFHKQI I E I GE LKS Y LVQSAI E I RS E 


•tprp 
£ 1 




A ^ *7 {Z CO/IT 


FT 




/ codon start=l 


FT 




/db xref ="SWISS-PROT : P44434" 


FT 




/note="similar to SP:P06989 GB:D43637 GB:U02072 GB:X03974 


t i 




PID: 41/00 percent identity: 61.22; identified by sequence 


FT 




similarity; putative" 


FT 




/transl_table=ll 


FT 




/ gene— "HI 0 475 


£ I 




/product="phosphoribosyl-AMP cyclohydrolase / 


TTT* 
£ 1 




phosphoribosyl-ATP pyrophosphohydrolase {hisIE) " 


Trim 
£ 1 




/protein id- AAC22134 . 1 


FT 




/translation="MNITKIDWQKVNGLLPVIIQNISTREVLMLGYMNEEALTKTIKER 


FT 




KVT FFSRTKQRLWTKGE I SGNFLNVEEMS LDCDNDTLLI LVDPI GATCHTGE YSCFHQF 


FT 




TSPQSENKKQQFANWAWFIKLEQHLKEKKNADPSNSYTATLHAKGTKKIAQKVGEEGVE 


FT 




TAIAAVAQDKAEVISEATDLVYHLTVLLHNQDLQWYEIIAKLQERHQGIGLHPEGGNK" 


T7>rp 

£ I 


CDS 


5323 . . 5475 


FT 




/codon_start=l 


FT 




/db xref="SWISS-PROT : P44002 " 


FT 




/note="hypothetical protein; identified by GeneMark; 


FT 




putative" 


Trim 
£ 1 




/transl table=ll 


FT 




/gene="HI0476" 


FT 




/product="H. influenzae predicted coding region HI0476" 


FT 




/protein id="AAC22144 . 1" 


FT 




/ translation="MVIKCIDKQQNLGNIILFLLLKQQYSKEDSKKFTIYKFYLQTVNY 


FT 




TIQLS" 


FT 


CDS 


5505. .6707 
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FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
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FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 



CDS 



CDS 



CDS 



CDS 




/codon_start=l 

/ db_xr e f = " SWI S S - PROT : P44727 " 

/note="similar to GB:M23240 SP:P18199 PID:148089 GB:U00096 

PID: 1736569 percent identity: 47.24; identified by sequence 

similarity; putative" 

/transl_table=ll 

/gene="HI0477" 

/product="tyrosine-specif ic transport protein (tyrP) " 
/protein id=" AAC22135 . 1" 

/ translation="MNKTVGSTLLVAGTMIGAGMIJ\MPLTSAGIGFGFTLVLLLGLWAL 
LTFSALLFVELYQTAESDAGIGTLAEQYFGKTGRIIATAVLIIFLYALIAAYISGGGSL 
LKDLLPESFGDKVSVLLFTVIFGSFIVIGTHSVDKINRVLFFVMLAAFAWLSLMLPEI 
KFDNLMATPI DKALI I SAS PVFFTAFGFHGS I PS LNKYLDGNVKALRFS I LVGSAITLC 
AY I LWQ LS T H GL LTQNE FLQ I LKE DAT LNGLVKAT FAI T GSNV I AS AVKL FS T LAL ITS 
FLGVGLGLLECIEDLLKRSF^A/'TAGRISLGLLTFIPPLVFALFYPEGFILALGYAGQMF 
AFYAWLPVSLWKARRAHANLPYKVWGGNLTLIIVLVLGVLITSIPFAIRAGYLPFVV 
G" 

complement (68 93. .7321) 
/codon_start=l 

/db xref="SWISS-PROT: P43718 " 

/note="similar to GB:L10328 SP:P00832 GB:M25464 GB:V00267 

GB:V00311 percent identity: 59.56; identified by sequence 

similarity; putative" 

/transl_table=ll 

/gene="HI0478" 

/product="ATP synthase Fl, subunit epsilon (atpC) " 
/protein id=" AAC22136 . 1" 

/ translation="MATFNLTIVSAEQKIFEGEVKQIQVTGVEGELGILPGHTPLLTAI 

KPGIVKFTLKDGNEEVIYVSGGFLEVQPNIVTVLADIAIRGSELDADRIHEAKRKAEEN 

IVSRGSDADHDLLVAKLSKEL7^KLRAYELTEKLLKTRR" 

complement (7351 . . 8724) 

/ codon_start=l 

/ db_x r e f = " SWI S S - PROT : P43715 " 

/note="similar to GB:L10328 SP:P00824 GB:M25464 GB:V00267 

GB:V00311 percent identity: 89.30; identified by sequence 

similarity; putative" 

/transl_table=ll 

/gene="HI0479" 

/product="ATP synthase Fl, subunit beta (atpD)" 
/protein id=" AAC22137 . 1" 

/ translation="MSAGKIVQIIGAVIDVEFPQDAVPKVYDALKVESGLTLEVQQQLG 

GGWRCIALGTSDGLKRGLKVENTNNPIQVPVGTKTLGRIMNVLGEPIDEQGAIGEEER 

WAIHRSAPSYEEQSNSTELLETGIKVIDLICPFAKGGKVGLFGGAGVGKTVNMMELIRN 

I AI EHS GY S VFAGVGERTRE GND FYHEMKDSNVLDKVS LVY GQMNE P PGNRLRVALTGL 

TMAEKFRDEGRDVLFFVDNIYRYTLAGTEVSALLGRMPSAVGYQPTLAEEMGVLQERIT 

STKTGSITSVQAVYVPADDLTDPSPATTFAHLDSTWLSRQIASLGIYPAVDPLDSTSR 

QLDPLWGQEHYDVARGVQGILQRYKELKDIIAILGMDELSEEDKLWARARKIERFLS 

QPFFVAEVFTGSPGKYVTLKDTIRGFKGILDGEYDHIPEQAFYMVGSIDEVLEKAKNM" 

complement (8741 . . 9610) 

/ codon_start=l 

/ db_x r e f = " SW I S S - PROT : P4371_6" 

/note="similar to GB:L10328 SP:P00837 GB:M25464 GB:V00267 

PID: 146324 percent identity: 65.28; identified by sequence 

similarity; putative" 

/transl_table=ll 

/gene="HI0480" 

/product-"ATP synthase Fl, subunit gamma (atpG) " 
/protein id=" AAC22138 . 1" 

/translation="MAGAKEIKTKIASVQSTQKITKAMEMVATSKMRKTQDRMAASRPY 
SETI RNVI SHVSKAS I GYKHPFLVERE VKKI GI LVI STDRGMCGGLNVNLFKTTLNQIK 
NWKEQNISTDLGLIGSKGISFFRSFGFNIKGQLSGLGDTPALEELIGVANTMFDAYRNG 
E I DAVY I AYNKFVNTMSQKPWQQLVPLPE SKDDHLNE RQQTWDYLYE PE PKVLLDS LL 
VR Y LE S QI Y QAWDNVAS E QAARMVAMKAATDNAGNL I NDLRLVYNKARQAS I TNE LNE 
IVAGAAAI " 

complement (9626. .11167) 
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FT fflV /codon_start=l 

FT M /db xref="SWISS-PROT: P43714 " 

FT /note="similar to GB:L10328 SP:P00822 GB:M25464 GB:V00265 

FT GB:V00266 percent identity: 86.55; identified by sequence 

FT similarity; putative" 

FT /transl_table=ll 
FT gene= "HI 04 81" 

FT >, ^/product="ATP synthase Fl, subunit alpha (atpA) " 

FT /protein id=" AAC22139 . 1 " 

FT /translation="MQLNSTEISELIKKRIAQFDWSEARNTGTIVSVSDGIIRIHGLS 

FT DVMQGEMIALPGNRYAMALNLERDSVGAWMGPYADLAEGMEVQCTGRILEVPVGRGLL 

FT GRWNTLGQPIDGKGEIENDGFSPVEVIAPGVIDRRSVDQPVQTGYKAVDSMVPIGRGQ 

FT RE L 1 1 GDRQT GKTALAI DAI I NQRNS GI KC I Y VAI GQKAS T I ANWRKLE E H GALANT I 

FT WAASASESAALQYLAPYAGCAMGEYFRDRGEDALIVYDDLSKQAVAYRQISLLLRRPP 

FT GREAYPGDVFYLHSRLLERASRVNEDYVEKFTKGEVKGKTGSLTALPIIETQAGDVSAF 

FT VPTNVISITDGQIFLESNLFNSGIRPAVNPGISVSRVGGSAQTKVIKKLAGGIRTALAQ 
FT { YRELAAFAQFASDLDDATRKQLSHGEKVTELLKQKQFTPLSVAEQAVILFAVEFGYLDD 

FT V. ^VELSKIAS FETAL LDYSNRNHAE FMQELNKTGNYNDE I KDTLKS I LDGFKANSAW" 

FT CDS complement (11180. . 11713) 

FT /codon_start=l 

FT / db_x r e f = " SWI S S - PROT : P43717 " 

FT /note="similar to GB:L10328 SP:P00831 GB:M25464 GB:V00264 

FT GB:V00266 percent identity: 57.95; identified by sequence 

FT similarity; putative" 

FT /transl_table=ll 

FT /gene="HI0482" 

FT /product="ATP synthase Fl, subunit delta (atpH) " 

FT /protein id=" AAC22140 . 1" 

FT /translation="MSELTTIARPYAKAAFDFAIEQSAVEKWTEMLGFAAAVAEDETVK 

FT AYLSS SLSAQKLADTVI S I CGEQLDQYGQNLI RLMAENKRLSAI PAVFEE FKHHVEEHQ 

FT AIAEVEVTSAQPLNATQIEKIAA7^IEKRLARKVKLNCNVDNALIAGVIVRTEDFVIDGS 

FT SRGQLTRLANELQL" 

FT CDS complement (11726. . 12196) 

FT /codon_start=l 

FT /db xref="SWISS-PROT: P43720 " 

FT /note="similar to GB:L10328 SP:P00859 GB:M10422 GB:M25464 

FT GB:V00264 percent identity: 63.46; identified by sequence 

FT similarity; putative" 

FT /transl_table=ll 

FT /gene="HI0483" 

FT /product="ATP synthase F0, subunit b (atpF) " 

FT /protein id=" AAC22 141 . 1" 

FT /translation="MNLNATLIGQLIAFALFVWFCMKFVWPPIINAIETRQSQIANALA 

FT SAEAAKKEQADTKNLVEQELSAAKLQAQDILDAANKRRNEVLDEVKAEAEELKAKIIAQ 

FT GYAEVEAERKRVQEELRLKVASLAVAGAEKI VGRS I DEAANNDI IDKLVAE L " 

FT CDS complement (12246. . 12500) 

FT /codon_start=l 

FT / db_x r e f = " SWI S S - PROT : P43721 " 

FT /note="similar to SP:P12991 PID: 48334 percent identity: 

FT 62.65; identified by sequence similarity; putative" 

FT /transl_table=ll 

FT /gene="HI0484" 

FT /product="ATP synthase F0, subunit c (atpE)" 

FT /protein id=" AAC22142 . 1" 

FT /translation= n METVITATIIGASILLAFAALGTAIGFAILGGKFLESSARQPELA 

FT SSLQTKMFIVAGLLDAIAMIAVGISLLFIFANPFIGLLN" 

FT CDS complement (12556. . 13344) 

FT /codon_start=l 

FT / db_x ref= M SWISS- PROT : P43719 " 

FT /note="similar to GB:L10328 SP:P00855 GB:M14019 GB:M25464 

FT GB:M29174 percent identity: 58.17; identified by sequence 

FT similarity; putative" 

FT /transl_table=ll 

FT /gene="HI0485" 

FT /product="ATP synthase F0, subunit a (atpB) " 
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FT /protein id= H AAC22143 . 1" 

FT /translation="MSGQTTSEYISHHLSFLKTGDGFWNVHIDTLFFSILAAVIFLFVF 
FT SRVGKKATTGVPGKMQCLVEIWEWNGIVKENFHGPRNVVAPLALTIFCWVFIMNAID 
FT LIPVDFLPQFAGLFGIHYLRAVPTADISATLGMSICVFFLILFYTIKSKGFKGLVKEYT 
FT LHPFNHWAFIPVNFILETVTLLAKPISLAFRLFGNMYAGELIFILIAVMYSANT^IAIAAL 
FT GI PLHLAWAI FHI LVITLQAFI FMMLTWYLSIAYNKADH " 

FT CDS complement (13381. .13755) 

FT /codon_start=l 
FT / db_x r e f= tf SWISS- PROT : 086225 " 

FT /note="Brute Force ORF; identified by GeneMark; putative' 1 

FT /transl_table=ll 
FT /gene= M HI0485.1 ,, 

FT /product="H. influenzae predicted coding region HI0485.1 H 

FT /protein id=" AAC22145 . 1 " 

FT /translation="MSRILSHAKKNYRKAIVIESLLLWFYLLIYGWQRQSAVDFSYGF 
FT LSAFLPFCTFIFIIFYRKQNFSTKLTALYRAEAIKFILTMVFIIIAIKWLFVINFIAFF 
FT VGFLLALVLNNI I PLILNKI " 

XX 

SQ Sequence 13828 BP; 4368 A; 2718 C; 2562 G; 4180 T; 0 other; 

ttaattatag atctaagcac gacacgtaag cagtcaaatt ttcatacaac aatacgaaat 60 
atgtatgaat gattacaatt cgcacatttt acaatggaag aaaacaataa tgacaatcac 120 
aactttatcc cgacaaaata tacaagcctt aacgccatat caatccgctc gaaaactcgg 180 
cggcaatggt acgatttggc tgaatgcaaa tgaatatcca acttcgcctg aatttcagct 240 



gcaagctgtg tttaaggcat tgtgggaaca aggaattatt ttaagagacc agaataagac 1080 

attgcattta caaaattgta ttcgcattac tgtgggaaca aggaatgagt gtgagaaagt 1140 

tgttgaagca attaaggagg taaaatgaac cacctcctat ttaagtcaca gagcgattcg 1200 

agattattta gttttatact gtttgtataa agcaaatgat aaacctcata ccattcatat 1260 

aacatacacg gagaaaaata tgcaacctac tctttttatc gaccgagacg gcacattaat 1320 

tgatgaacca aaaactgatt tccaaattga cagtttagag aaactaaaat tagaaccgaa 1380 

agtgattccc gcactcctca gactgaaagc gaaataccgt tttgtgatag tttctaatca 1440 

agatgggctt ggtacggatg cttttccgca aacagatttt gataagcccc acaatgtaat 1500 

gatggcatta tttgaatctc agggcattac ctttgatgaa gtactcattt gcccgcataa 1560 

gcccgaagaa aattgtctat gtcgcaaacc aaaaattaag ttgcttgatc actatattcg 1620 

taaaaattta ttcgatattg accgctcttt tgtcattggt gaccgagaga ccgatgtgca 1680 

attagcagaa aatctaggta ttcgagctat tcaatacgat ccacagaaaa tgaattggga 1740 

tctgattgca gaaaaattgt tgggggaaac ggtcacaaac tgtggcaaac gtccgccacg 1800 

ttttgcagaa gtaattcgtc aaactaaaga gaccgatatt aaagtgcagg tttggctaga 1860 

tgaagctggt gtaaatgaaa tcaaaaccgg cgttggtttt tttgaccata tgctcgatca 1920 

aatcgccaca cacggtgggt ttagaatgaa cgtgcagtgt aaaggcgatt tatggattga 1980 

cgaacatcac acggtggagg acaccgctct cgcccttgga caagcactga aacaagccgt 2040 

tggtgacaaa cgtggtattg cccgtttcgg ctttgtgctt ccaatggacg aatgtaaagc 2100 

agaatgtgct ttagatttat cgggcagacc ttggatcaaa ttcaatgcct gctttaaacg 2160 

cgataaagtg ggcgatttta gtacggaact cactgaacat ttctttcaat ctttggcatt 2220 

ttcaatgctt gccacgcttc atttaaatgt cacaggtaat aacgatcatc ataaaataga 2280 

gagcttattt aaagcctttg ggagaacatt aagacaggcg attcggattg aaggaaatga 2340 

aatgccaagt tcaaaaggag ttttatagtt gaaagagcgg ttaatttttg tcgtattttt 2400 

gcaaaattcg accgattgta aaaggataaa aaaatgataa acatcacgat tatagacaca 2460 

ggttgtgcca acctttcttc tgtaaaattc gcctttgacc gattaggtta caataccgag 2520 

atcacttttg atctcaataa aatcaaatct gccgataagc tgattttacc tggcgtaggc 2580 

actgcaaatg cagcaatgta taatttacaa gaacgtcaat tgatcgagac cattcaaaat 2640 

ctaacacaac cagtgctagg aatttgtttg ggtatgcagt taatgaccga attttccgaa 2700 

gagggtaatg tgccgacctt aaacctaata agcggaaaaa ctaatcgaat tcccgatacg 2760 
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ggtttaccgt tacctcaaat gggatggaat cgcgtgcaat ttgtaaaaaa ctgtccgtta 2820 

tttgatggta ttgtgcaaaa tagtcacttc tattttgtgc atagttatgc agttagcccg 2880 

aatgaacatt cggtggcaat aagcaattat ggtgtgaatt tttctgctgc gatagcgaaa 2940 

gagaattttt acggtgtaca atttcatcct gaacgttcgg gtaaaaatgg tgcgttgttg 3000 

ctaaaaaatt ttgtggaaaa agtaccgttt taatcactag atcatatagt catacaggat 3060 

caaacacaat gaaacaatct attattattc ctgctcttga tcttatcaat ggtcaagttg 3120 

tgcggttaca ccaaggcgat tatgccaagc aaactaccta ttctgacaat ccaattaagc 3180 

aatttgataa ttatgtacgt caaggagcaa aacaactaca tctagtcgat ttaacgggag 3240' 

cgaaaaatcc ccagtcaaga caaaccgcac ttattggaaa aatcgttgaa gcaacacaat 3300 

gtaaagtaca agtgggagga ggcattcgta cagagcaaga tgtggcagat ttattggctg 3360 

tgggtgcgaa tcgcgtggtg attggttcaa ccgccgtaac tcatcgttca atggtgaaaa 3420 

actggttcat caaatatggt gcggaaaaat ttgttctagc attagacgtg aatattaatg 3480 

caagcggtca gaaaatcgtt gcgattagcg gttggcagga agaaagtggt gtactattag 3540 

aaacgttgat cgaagatttc caaaccgttg gcttacaaca agttttatgt accgatattt 3600 

cacgcgatgg caccttgaca ggttcaaata ttggacttta tcaagaaatt tgtgagaaat 3660 

atccacctat tcagtttcaa tcttctggag ggattggttc actggctgat attgaggctc 3720 

taaaaggcac aggtgtgtca ggtgtaattg taggtcgagc tttattagaa ggaaaattta 3780 

cgctttcgga ggcaatcaaa tgttggcaaa acggataatt ccttgtttgg atgtacgtga 3840 

tggacaggtg gtaaaagggg tacaattccg taaccacgaa attattggcg atattgtgcc 3900 

acttgctcaa cgctatgcgc aagaaggagc tgatgaattg gtattttatg atattacggc 3960 

ttcatcagac ggtcgtaccg tggataaaag ttgggtagaa cgaatcgctc aggtaattga 4020 

tattccgttt tgcgtagcag gtgggattaa aaccattgaa gatgcggaaa aattgtttgc 4080 

atttggggcg gataaaattt cgattaattc ccctgctctc gctgatccaa cgctcatttc 4140 

tcgcttagca gatcgttttg gcgtacaagc gattgtagta ggtattgata gttggtttga 4200 

acaggaaaca ggtaagtatt gggtaaacca atatacaggc gatgaaacac gtacccgaca 4260 

aactcattgg cagttgttag attgggtaaa agaagtacaa caatgcggtg caggagaaat 4320 

cgtgctaaat atgatgaatc aagatggttt acgaaacggg tacgatttag cccagctcaa 4380 

gctggtgcga ggagtatgta gagtacctct aattgcatct ggcggtgcgg gtaaaatggt 4440 

acattttcgt gatgctttta ttgaagcaaa agtagatggg gcattagctg ccagcgtatt 4500 

ccataaacag attattgaaa ttggcgagtt aaaatcctac cttgtgcaat cagccatcga 4560 

gattcggagt gaataatgaa tattactaaa atagactggc aaaaagttaa tggtcttctt 4620 

cctgttatta ttcaaaatat aagtactcgt gaggtactaa tgcttggcta tatgaatgaa 4680 

gaagccctta caaaaaccat caaagaacgt aaagtaactt tcttctctcg taccaaacaa 4740 

cgattatgga caaaaggaga aatctcgggc aattttttaa atgtggaaga aatgagctta 4800 

gattgcgata acgacacttt gctcatttta gttgatccca ttggggcaac ttgccacaca 4860 

ggtgaataca gctgcttcca tcaatttacc tctccacaat cagagaacaa gaagcaacaa 4920 

tttgcaaatt gggcgtggtt tataaaatta gaacaacatt taaaagaaaa gaaaaatgca 4980 

gatccttcta attcttatac tgcaacatta cacgcaaaag gtaccaaaaa aatcgcccaa 5040 

aaagtggggg aagaaggtgt ggaaactgcc ttggctgcgg tcgcacaaga taaagcggaa 5100 

gtcatttctg aagctacaga tttggtctat cacttaaccg tattattaca caatcaagat 5160 

cttcagtggt acgaaattat cgcaaaacta caagaacgcc atcaagggat tggattacat 5220 

ccagaaggag gaaataaata agaaaattgt caggcaatat taagtgagca attagaagat 5280 

ttgtttgggc aattcaatat catcgtaaaa tacagtgaaa ctatggtaat taaatgtatt 5340 

gataaacaac aaaatttagg aaatatcatt ttattccttc ttcttaagca acaatatagc 5400 

aaggaagata gtaaaaaatt cacaatttat aaattttatt tacagactgt aaattataca 5460 

atacaattat cgtaaatttt ttattacatc ataaggaaac aaaaatgaac aaaactgtag 5520 

gcagcacact tcttgttgct ggtacgatga ttggcgcggg aatgttggct atgccactca 5580 

cttccgctgg cattggcttc ggctttactt tagtcttgtt attggggctt tgggcattat 5640 

taacttttag cgcactttta ttcgttgaac tctatcaaac tgcggaaagt gatgcaggta 5700 

tcggcacact tgcagaacaa tattttggta aaacaggacg cattattgcc acagccgtgc 57 60 

taattatttt cttatacgca ttaattgccg cttatatcag cggtggcggt tcactactaa 5820 

aagatttatt accagaaagt tttggcgata aagttagcgt cttattattt accgtgattt 5880 

tcggttcatt tattgtcatc ggcacacata gcgtagataa aattaatcgc gtgttatttt 5940 

tcgttatgct tgctgccttt gccgtggtgt taagcttaat gttgccagaa atcaaatttg 6000 

ataacttaat ggcgacgcca attgataagg ctttaattat ctctgcaagc cctgtatttt 6060 

tcacggcatt tggtttccac ggttccattc caagtttaaa taaatactta gatggcaatg 6120 

taaaagcatt gcgcttctct attttggtcg gctctgccat aacactttgt gcttatattc 6180 
tatggcaact atccacgcac ggcttgctta cacaaaatga atttttacaa atcttaaaag * 6240 

aagatgcgac tttaaatggc ttagtaaaag caaccttcgc catcactgga agcaatgtga 6300 

tcgcaagtgc ggtaaaatta ttttctacat tagcacttat cacttctttc ttaggtgtgg 6360 

gattaggatt attagaatgt attgaagatt tactaaaacg ttcatttaac gttacagcag 6420 

gtagaatttc acttggttta ctgactttta ttccaccact tgtttttgca ttattttacc 6480 

cagaaggttt tattttagca ctaggctacg caggtcaaat gtttgcattc tacgccgtag 6540 

tattgccagt tagccttgtt tggaaagctc gtcgtgccca cgctaatttg ccatacaaag 6600 

tgtggggagg aaatctgact ttaattatcg tattggtatt aggcgtgctc attacatcca 6660 
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ttccatttgc gattcgtgcg ggttatttac cctttgttgt aggttaaaaa caaactaaaa 6720 

atttaccgca ctttttatct aaaaaagagt aaatattgtc ctaattaatt tagataaaaa 6780 

tttcagacaa aacttatcat taatcttata caacaaaaag gctaagttct aaaattatag 6840 

aacttagccc ttaaactaaa tgaagttcac ttcatctata ttatactttc aattatcgtc 6900 

ttgttttcaa tagtttttca gtgagttcat aagctcgaag ttttgctaac tctttagaaa 6960 

gttttgcaac aagtaagtcg tgatctgcat cagatccgcg agatactata ttttcttctg 7020 

ctttacgttt tgcttcgtga atacgatctg catctaattc actacctcga atagcaatat 7080 

ctgctaatac tgtcacaata tttggttgaa cttctaaaaa accaccagaa acatagataa 7140 

cttcttcatt cccatcttta agagtaaatt taacaatccc aggcttaatt gctgttagca 7200 

atggcgtatg tccgggcaaa ataccaagct ccccttccac accagttacc tgaatttgtt 7260 

ttacttcacc ttcaaagatt ttctgctctg cgcttactat tgttaaatta aatgtcgcca 7320 

ttttgttctc cttcaaatga cttgaagtga ttacatattt ttggcttttt ctaacacttc 7380 

atcgattgaa ccaaccatat agaacgcttg ttcaggaata tggtcatatt cgccatctaa 7440 

gatacctttg aagccacgga tggtgtcttt taatgtcacg tatttacctg gtgaacctgt 7500 

gaagacttct gcaacaaaga atggttgtga taagaaacgt tcaattttac gtgcacgtgc 7560 

taccactagt ttatcttctt cagataattc atccatacca agaattgcga taatatcttt 7620 

caattcttta taacgttgta aaataccttg tacaccacga gcaacatcat aatgttcttg 7680 

accaacaaca agcgggtcta gctgacgtga agttgaatct aatggatcaa ctgcagggta 7740 

aatacctaaa gatgcgattt gacgacttaa tacaacagtt gagtctaaat gtgcgaaagt 7800 

tgttgctgga gatgggtcag ttaagtcatc cgctggtacg tacaccgctt gcacagaagt 7860 

gatagaacct gttttggttg aagtgatacg ttcttgtaac acacccattt cttctgccaa 7920 

tgtcggttgg taacctaccg cggatggcat acgacctaat aacgcagata cttccgtacc 7980 

agcaagggta taacgataga tattatccac aaagaataat acatctcgac cttcatcacg 8040 

gaatttttca gccatagtta aaccagtcaa tgcaacacgt aaacggttac ctggtggctc 8100 

gttcatttga ccataaacca aagatacttt atctaatacg ttagaatctt tcatttcatg 8160 

atagaagtca ttaccttcac gagtacgttc gcctacaccc gcaaatacag agtaacctga 8220 

atgctcaatc gcaatattac gaatcaactc catcatattt acggttttac ctacacctgc 8280 

accgccgaat aaaccaactt taccaccttt tgcgaatgga caaattaagt cgatcacttt 8340 

gataccagtt tctaataatt ccgtactgtt tgattgttct tcatagctcg gtgccgaacg 8400 

gtggatagcc caacgctctt cttcaccaat tgctccttgt tcgtcaattg gttcgcccaa 8460 

cacattcata atacgaccaa gtgtttttgt gcctactggt acttgaattg ggttatttgt 8520 

gttttctact tttaagccac gttttaaacc atcagatgta cccaatgcaa tacaacgaac 8580 

tactccgcca cctaattgtt gttgcacctc aagtgttaaa cctgattcaa cttttaatgc 8640 

atcgtaaact tttggcactg catcttgtgg aaattcaacg tcaatcaccg caccgatgat 8700 

ttgtacaatt tttcctgctg acattaccgt tcctcttttc ttaaatcgcc gccgcaccgg 8760 

ctacgatttc attcaattca tttgtgatac ttgcttgacg agctttgttg tacaccaacc 8820 

gcagatcatt aattaaatta cctgcattat ccgttgctgc tttcattgct accattcgag 8880 

ccgcttgttc tgaagctaca ttatctacaa ccgcttgata aatttgggac tctaaataac 8940 

gaactaaaag gctatctaat agtacttttg gttctggctc ataaagataa tcccaagtct 9000 

gttgtctttc atttaaatga tcgtctttag attctggtaa aggaactaat tgttgtacaa 9060 

caggcttttg cgacatcgta ttaacaaatt tattgtatgc aatataaact gcatcaattt 9120 

caccattacg ataagcatca aacattgtat ttgccacacc aattaattct tctagagcgg 9180 

gcgtatcgcc taaaccagaa agctgacctt tgatattaaa tccaaaggaa cggaaaaaac 9240 

taatcccttt tgaacctatt aagcccaaat ctgtagaaat attttgttct ttccaatttt 9300 

ttatttggtt aagtgtggtt ttgaataaat taacatttaa cccaccacac atcccacgat 9360 

ctgttgaaat aaccaagata ccgattttct tcacttcgcg ctcaactaag aacggatgtt 9420 

tataaccgat acttgcctta gacacatgac taataacgtt acggatagtt tcagaatacg 9480 

gacgagatgc agccatacga tcctgcgttt tacgcatttt cgaggttgcc accatttcca 9540 

ttgccttagt gattttttgt gtactttgta cactggcaat tttggttttt atctcttttg 9600 

cacctgccat cttatctctc cgttactacc aagcactatt cgctttaaaa ccatctaaaa 9660 

tactttttaa agtatctttg atttcatcat tatagttacc ggttttatta agctcttgca 9720 

taaattcagc gtgattacgg ttagaataat ctaaaagtgc ggtctcaaaa cttgcaattt 9780 

ttgataattc cacgtcatcc aaataaccaa attcaacagc aaacaatata accgcttgtt 9840 

ctgcaacaga taatggcgtg aattgttttt gttttaataa ttcagttact ttttcaccgt 9900 

gagaaagttg cttacgagtc gcatcatcta gatctgatgc aaactgtgca aacgctgcta 9960 

attcacgata ttgagctagc gcggtacgaa taccacccgc taatttctta ataactttag 10020 

tttgcgctga accaccaaca cgagataccg aaatacctgg gtttaccgca ggacgaatac 10080 

ctgagttaaa caagttagat tctaagaaga tctgaccatc ggtaatagaa attacgttgg 10140 

ttggaacaaa cgcagatacg tcaccagctt gggtttcaat aatcggaagt gcggttaaag 10200 

aacctgtttt tccttttact tcgcctttag tgaatttttc tacataatct tcatttacac 10260 

gagaagcacg ttctagtaaa cgagaatgta aatagaacac atcacctgga taggcttcac 10320 

gacctggtgg acgacgtaat aataatgaaa tttgacgata agctacggct tgttttgata 10380 

aatcatcata aacaattaac gcatcttcgc cgcgatcacg gaaatattcc cccatcgcac 10440 

aaccagcata aggggctaaa tattgtaacg ctgctgattc agatgcagat gcagctacca 10500 

cgatagtatt tgcaagcgca ccatgttctt ctaatttacg cacgacattt gcaatagtag 10560 
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aggctttttg 
tagcgtcaat 
gaccacgacc 
caacagaacg 
tttcaatttc 
aaccacgacc 
ctaaatctgc 
ccatcgcata 
cgtggatgcg 
tgaccacgtc 
gttgcatctt 
ggctacttcc 
tatccacgtt 
ttttttcgat 
cttggtgttc 
gcttattttc 
aaatagaaat 
ttaccgtttc 
ccgcactttg 
ttaattctga 
tgtttgccgc 
aagctacttt 
aaccttgtgc 
cttcattgcg 
aaagttcttg 
acgctaaagc 
aaacaaactt 
ttgcatttaa 
ggtgattagt 
acagcaatca 
ctagatgcta 
aagccaattg 
gctgtaatta 
gttaataaaa 
caacatcata 
aagatgtaac 
aagaataaag 
tggtttagcc 
attgaatgga 
ataaaaaaga 
tgtcggcact 
aacaggaatt 
tggcgcaact 
cacaacaatt 
tactcgagaa 
tatatgaaca 
ttccgatgtt 
ttaaatttta 
aacaaaaaac 
catcgttaaa 
attttgtttt 
aaaaccgtag 
aactacaagc 
taaaatgcgc 
ctttagattc 



accaatcgca 
tgctaacgca 
aatcggtacc 
acgatcgata 
gcctttacca 
aactggtact 
gtaaggtccc 
acgattacct 
aataatacca 
gaattgagcg 
gtattcctct 
gtcaatcaca 
gcaatttaat 
ttgtgttgca 
ctctacgtga 
agccattaac 
tactgtatca 
atcttcagct 
ttcaatggca 
cataagctaa 
ttcatcaata 
aagacgtaat 
aataattttt 
acgtttattc 
ttcaacaaga 
gttcgcaatt 
catgcaaaac 
attcacaacg 
ttaataaacc 
ttgcaatcgc 
attctggttg 
cagtacctaa 
cagtttccat 
aacgattaat 
aagatgaaag 
ggaatcccta 
atcaactccc 
aataaagtca 
tgaagcgtat 
attaaaaaga 
gcacgaagat 
aagtcgatag 
acgttacgtg 
tcaactaagc 
aaaacaaata 
ttccaaaatc 
gtttgtccag 
ttgagaataa 
gcaataaaat 
ataaacttta 
ctgtaaaaaa 
ctaaaatcta 
aaaagactct 
gacatcactt 
aacttaaaaa 



acatagatac 
gttttacccg 
atagaatcga 
acacctggcg 
tcgattggct 
tcaagaatac 
ataactacag 
ggtaaggcga 
tcgcttacag 
atgcgttttt 
tataattgca 
aagtcttcag 
ttcactttgc 
ttcaatggtt 
tgtttaaatt 
cgaataagat 
gctaatttct 
acggcagcag 
aagtcgaatg 
gcctcttata 
gaacgaccca 
tcttcttgaa 
gcttttagtt 
gcagcatcta 
tttttcgtat 
tggctttgac 
cacacgaaaa 
tcctccttac 
gatgaatggg 
atctaaaaga 
gcgagctgat 
tgcagcaaat 
aattttctcc 
gatctgcttt 
cctgcaaggt 
atgctgcaat 
ctgcatacat 
ccgtttcaag 
attctttaac 
aaacacaaat 
aatgaatacc 
cattcatgat 
gaccatgaaa 
attgcatttt 
ggaaaataac 
catcccctgt 
acataatcga 
gtggaataat 
tgataacgaa 
ttgcttcagc 
tgataaatat 
ccgcactttg 
caataacaat 
ttacaaaata 
cacataaa 



atttaatacc 
tttgacggtc 
cagctttata 
caatcacttc 
gaccaagcgt 
gacctgtaca 
caccaacaga 
tcatttcacc 
aaacaattgt 
taatcaattc 
actcgtttgc 
tacgaacaat 
gagctaatct 
gcgcagatgt 
cttcaaacac 
tttgcccata 
gtgcagaaag 
caaaacctaa 
cagcttttgc 
actctgcaac 
caattttctc 
cacgtttacg 
cttcggcttc 
aaatgtcttg 
ctgcttgctc 
gtgtttcaat 
gtgcgaatgc 
tatgcttgtt 
tttgcgaaaa 
ccagctacga 
gattctaaga 
gcaagaagaa 
aatagaagtt 
gttataagca 
aataaccaaa 
ggccatatta 
gttaccgaat 
aataaaatta 
taaacctttg 
ggacataccc 
gaacaaacct 
gaatacccaa 
attttctttc 
tcctggaacg 
cgcagcaaga 
tttgagaaag 
tcctttgttc 
attattcagc 
taaccatttg 
acgataaagt 
aaatgtacaa 
gcgttgccat 
agcttttctg 
attttagccc 



tgaattacgt 
accgatgatt 
accagtttga 
tactggtgag 
attaactaca 
ttgaacttcc 
atcgcgttca 
ttgcatcaca 
ccctgtatta 
actaatttca 
aagacgagta 
cacaccagca 
tttttccatt 
tacttcaact 
cgcaggaata 
ttgatccaat 
agaactactt 
catttcagtc 
ataagggcga 
taatttatca 
agcaccagcc 
ttctgcttct 
tgccttcact 
agcttgtaat 
ttttttagct 
cgcattaata 
gataagttga 
ttacgttgat 
tgaaaagtaa 
taaacatttt 
atttaccacc 
ttgatgcacc 
aagttttagc 
atacttaaat 
atatggaaaa 
gcggaataca 
aaacggaagg 
acaggaataa 
aatcctttag 
aaagttgcgc 
gcaaattgag 
cagaaaattg 
acgattccat 
ccagttgttg 
atagaaaaga 
gataaatggt 
tgaagaagaa 
actaatgcca 
atggcaataa 
gcggtcagtt 
aaaggcaaaa 
ccataaataa 
tagttttttt 
gccgattata 



tgattaataa 
aactcacgct 
acaggttgat 
aaaccatcat 
cgacctaata 
ataccttccg 
aggttaagtg 
tcgcttaaac 
cgcgcttcac 
gttgaattta 
agttgcccac 
ataagtgcgt 
gcagctgcaa 
tctgcaatag 
gcactcaaac 
tgttcgccac 
aagtaagctt 
catttttcga 
gctattgtag 
ataatgtcat 
actgccaatg 
acttcagcat 
tcgtctaata 
tttgcagctg 
gcttcagctg 
attggtggcc 
ccaatcaatg 
aagcaaataa 
tgaaatacca 
agtttgtaag 
tagaatagca 
gatgattgtt 
cctagagccg 
aaacaaccgt 
tagcccaagc 
ttacagcgat 
caagtgaaat 
acgcccaatg 
attttattgt 
taatatctgc 
gtaagaaatc 
ttaatgcaag 
taacccattc 
cttttttccc 
acaacgtatc 
ggctaatgta 
taacccaaag 
ataaaaaacc 
tgataaaaac 
ttgttgaaaa 
aggcacttaa 
gcaaataaaa 
tagcgtgaga 
cctttgctgg 



10620 
10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
11940 
12000 
12060 
12120 
12180 
12240 
12300 
12360 
12420 
12480 
12540 
12600 
12660 
12720 
12780 
12840 
12900 
12960 
13020 
13080 
13140 
13200 
13260 
13320 
13380 
13440 
13500 
13560 
13620 
13680 
13740 
13800 
13828 
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ID AF032869 standard; DNA; PRO; 307 BP. 
XX 

AC AF032869; 
XX 

SV AF032869.1 
XX 

DT 24-NOV-1997 (Rel. 53, Created) 

DT 30-NOV-1997 (Rel. 53, Last updated, Version 3) 
XX 

DE Actinobacillus pleuropneumoniae ATP synthase gene, partial cds . 

XX 

KW 

XX 

OS Actinobacillus pleuropneumoniae 

OC Bacteria; Proteobacteria; gamma subdivision; Pasteurellaceae; 

OC Actinobacillus. 

XX 

RN [1] 

RP 1-307 

RA Bacmeister C.X., Fenwick B.W.; 
RT 

RL Submitted ( 03-NOV-1997 ) to the EMBL/GenBank/DDBJ databases. 

RL Diagnostic Medicine/ Pathobiology, Kansas State University, 1800 Denison 

RL Ave, Manhattan, KS 66506-5605, USA 

XX 

DR SPTREMBL; 031195; 031195. 



XX 

FH Key Location/Qualifiers 
FH 

FT source 1 . . 307 

FT /db_xref="taxon:715" 

FT /organism="Actinobacillus pleuropneumoniae" 

FT /strain="4074" 

FT /serotype="l n 

FT CDS l..>307 

FT /codon_start=l 

FT / db_xre f = " S PTREMBL : 031195 " 

FT /transl_table=ll 

FT /product="ATP synthase" 

FT /protein id=" AAB87466 . 1 " 

FT /translation="MATGKIVQIIGAVIDVEFPQDAVPKVYDALKVESGLTLEVQQQLG 
FT GGLVRCIALGTSDGLKRGLKVENTGNPIQVPVGTKTLGRIM^^VLGEPIDEKGPIGEV" 
XX 



SQ Sequence 307 BP; 97 A; 54 C; 79 G; 77 T; 0 other; 

atggcaacag gaaaaattgt acagattatc ggtgcggtaa tcgacgttga attcccgcaa 60 

gatgcagtac caaaagtata tgatgcctta aaagttgaat cgggtttaac ccttgaggtt 120 

caacaacaat taggtggtgg acttgtgcgt tgtatcgcat taggtacatc agacggttta 180 

aaacgcggct taaaagttga aaatacgggt aacccgattc aagtaccggt cggtacaaag 240 

actctgggtc gtattatgaa cgtattaggt gagccaatcg atgaaaaagg ccctatcggc 300 

gaagtag , 307 

// 
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